PHYLOGENETIC STUDIES USING PROTEIN SEQUENCES 
WITHIN THE ORDER MYRTALES! 
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ABSTRACT 


N-terminal sequences (40 amino acids) of ribulose bisphosphate carboxylase small subunit are 
presented for representatives of Myrtaceae (2 spp.), Melastomataceae (2 spp.), Combretaceae (2 spp.), 
Lythraceae (2 spp.), Thymelaeaceae (2 spp.), Onagraceae (11 spp.), Buxaceae sensu lato (2 spp.), 
Euphorbiaceae (2 spp.), and Tiliaceae (2 spp.). In minimal phylogenetic trees the two members of 
most families pair well, while, in Onagraceae, the phylogenetic tree gives a reasonable approximation 
to the expectations from taxonomy. The first four families group together as expected for families of 
Myrtales, but Onagraceae are slightly separate. Thymelaeaceae grouped with the Myrtales families. In 
order to test the hypothesis that Thymelaeaceae are related to Euphorbiales and both to Malvales, 
phylogenetic trees embracing all groups were derived. Thymelaeaceae failed to group with Malvales 
but both showed relationships to Myrtales. Within Euphorbiales and Malvales, grouping of species 


conformed well with expectations. 


A symposium on “The Order Myrtales” was 
held at the XIII International Botanical Congress 
in 1981 and, when introducing the published 
proceedings, Raven (1984) summed up the con- 
sensus opinion that this order is a clearly defined 
group. This suggests that the families that com- 
prise the Myrtales may be favorable subjects for 
a study in which a comparison is to be made 
between phylogenies derived from morphologi- 
cal, anatomical, and micromolecular character- 
istics and from an independent approach such 
as the computerized analyses of macromolecular 
sequence data. 

The most comprehensive attempt to build an 
angiosperm phylogeny from sequence data is that 
of Martin et al. (1985), which combined into a 
single analysis sequences of four proteins and one 
RNA for up to 11 families. Phylogenetic trees 
derived from a single macromolecule did not 
agree with each other and consistency was main- 
tained only when a fourth macromolecule was 
added to the combined sequences of three pro- 
teins. The explanation advanced for this result 
was that, for four of the macromolecules (the 
exception was the small subunit of ribulose bis- 
phosphate carboxylase, henceforth RBC-SSU), a 
family was often (17 out of 33 cases) represented 
by only a single sequence. Martin et al. (1985) 
suggested that errors were less likely if a family 
node was derived from more than one member 


ofa family. The present investigation was carried 
out to test this idea that replication is effective, 
a measure of the correctness of a phylogeny being 
available in Myrtalean taxonomy. However, in 
our laboratory it is practicable to sequence only 
RBC-SSU and there are no sequences of other 
macromolecules available for the Myrtales. 
Therefore the test is limited to what is achievable 
with only one sequence. It can be claimed that, 
with more sorts of sequence data, greater pre- 
cision would follow. 


MATERIALS AND METHODS 


The 27 species that have been investigated are 
listed by family in Table 1. Methods are set out 
in detail in Martin and Jennings (1983) and the 
following is only a very brief account. One 
hundred grams of fresh leaves were treated using 
the “pungent-leaf method.” After maceration, 
species of Onagraceae were usually very muci- 
laginous so that more than the usual amount of 
extracting buffer was necessary to prevent solidi- 
fication before gel-chromatography. After puri- 
fication of the “‘fraction 1” protein, S-carboxy- 
methylation, and separation of the subunits, five 
milligrams of small subunit were sequenced on 
the Beckman Automatic Sequencer 890C. No 
attempt was made to progress beyond the first 
40 amino acids. Amino acids were identified us- 


' We acknowledge with thanks assistance from the Missouri Botanical Garden and a grant from the Australian 
Research Grants Scheme; grants to Peter H. Raven from the United States National Science Foundation, most 
recently BSR82-148, have also supported a portion of this work. We thank Richard Norrish for technical 
assistance and those who have supplied us with leaves, especially the Missouri Botanical Garden and the Adelaide 
Botanic Gardens. Peter Raven has given us great encouragement and valuable discussion. 

? Department of Botany, University of Adelaide, Box 498, G.P.O., Adelaide, South Australia 5001. 


ANN. MissouR! Bot. GARD. 73: 442-448. 1986. 


1986] 


TABLE 1. Species studied. ABG and MBG indicate 
that the leaves were collected from plants growing in 
the botanical gardens at Adelaide (ABG) and St. Louis, 
Missouri (MBG), respectively. AD indicates that leaves 
were collected elsewhere and a voucher specimen has 
been lodged in herbarium AD. For two species col- 
lected in the field there were only enough leaves for 
processing, not for voucher specimens. 


Myrtaceae 
Eucalyptus microcarpa (Maiden) 

Maiden AD 

Acmena smithii Poit. ABG 
Melastomataceae 

Melastoma affine D. Don. 

Tibouchina semidecandra Cogn. ABG 
Combretaceae 

Quisqualis indica L. MBG 

Combretum decandrum Roxb. ABG 
Lythraceae 

Lythrum salicaria L. ABG 

Woodfordia fruticosa (L.) Kurz ABG 
Thymelaeaceae 

Daphne odora Thunb. ABG 

Pimelea physodes Hook. 
Onagraceae 

Ludwigia peploides (Kunth) Raven AD 

Lopezia semeiandra (Plitmann) 

Raven & Breedlove MBG 
Circaea cordata Royle MBG 
Fuchsia hybrida Voss AD 
Epilobium ciliatum Raf. AD 
Epilobium canum (Greene) Raven ABG 
Hauya elegans DC. MBG 
Clarkia unguiculata Lindl. AD 
Clarkia rubicunda (Lindl.) Lewis & 

Lewis AD 
Gaura lindheimeri Engelm. & Gray ABG 
Oenothera stricta Ledeb. AD 

Buxaceae sensu lato 
Simmondsia chinensis (Link) 

Schneid. ABG 

Buxus sempervirens L. ABG 
Euphorbiaceae 

Acalypha wilkesiana Muell. Arg. ABG 

Ricinus communis L. ABG 
Tiliaceae 

Grewia occidentalis L. ABG 

Sparmannia africana L.f. ABG 


ing high pressure liquid chromatography and thin- 
layer chromatography. Sequence data were ana- 
lyzed by computer methods that have been de- 
scribed by Martin et al. (1985). Briefly the amino 
acid sequences were converted to inferred nu- 
cleotide sequences from which differences be- 
tween species were derived. The program esti- 
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FIGURE 1. Phylogenetic tree for 11 species of Ona- 
graceae. For specific names see Table 1. The two equal- 
ly probable consensus trees differed in that the branches 
carrying Gaura and Hauya were interchanged. Note 
that in these trees angles are meaningless. 


mates the lengths of all possible Steiner trees and 
the minimal phylogenetic tree is deemed the most 
probable. Consensus trees are derived from the 
shortest trees and this procedure is especially im- 
portant when more than one tree of minimal 
length occurs. Thus the method is essentially one 
of maximum parsimony. It should be noted that 
the number of possible trees increases exponen- 
tially with the number of taxa and, with most 
computers, usually only 11 taxa can be analyzed 
simultaneously. For taxa that group together 
constantly no matter what others are present, 
sequences of their common node can be derived 
and used in other analyses, so making the tree- 
building process more practical. 


RESULTS 


General. N-terminal sequences (40 amino 
acids) are listed in Table 2. More than 100 other 
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TABLE 2. N-terminal sequences of RBC-SSU. For other specific names and also names of families see Table 
1. A, alanine; D, aspartic acid; E, glutamic acid; F, phenylalanine; G, glycine; I, isoleucine; K, lysine; L, leucine; 
M, methionine; N, asparagine; P, proline; Q, glutamine; R, arginine; S, serine; T, threonine; V, valine; W, 


tryptophan; Y, tyrosine. 
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species of angiosperms have now been analyzed, 
from a wide range of families, and so it is possible 
to say that the range of variation exhibited in 
Table 2 is quite normal in all respects except one. 
The occurrence of phenylalanine and asparagine 
at positions one and two in most species of Ona- 
graceae is unknown elsewhere. 

Onagraceae. In Raven’s (1979) classification 
of Onagraceae, Zauschneria is combined with 
Epilobium and Godetia is combined with Clark- 
ia. The results support this taxonomy; prelimi- 
nary analyses on these sequences showed that 
Epilobium canum (formerly Zauschneria cali- 
fornica) grouped closely and consistently with 
Epilobium ciliatum. Similarly Clarkia rubicunda 
(formerly Godetia amoena) grouped consistently 
with Clarkia unguiculata. Considering these re- 
sults, and the taxonomic position, we derived 
the sequences at generic nodes for Epilobium and 
Clarkia and used these in subsequent analyses. 
This reduced the number of taxa from 11 to nine 
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and for these there were 57 trees of minimal 
length. This situation arose because there were 
two adjacent internodes of very short and equal 
length so that many different combinations were 
possible without change of overall tree length. 
For similar reasons there were two consensus 
trees. We have rationalized the situation and in 
Figure | have presented a single tree with a circle 
of uncertainty at the relevant node and showing 
the difference between the two consensus trees. 
According to Raven (1979) there are seven tribes; 
Onagreae (represented here by Clarkia, Oeno- 
thera, and Gaura), Epilobieae, and Hauyeae arise 
from this circle of uncertainty. The remaining 
four taxa in our analysis represent four mono- 
generic tribes and the analysis shows them as 
separate. The point at which other families join 
into this tree (see below) is taken to be the root 
of the Onagracean tree and this makes the first 
divergence between Ludwigia and the rest. This 
is in part because Ludwigia is the only species 
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TABLE 2. Continued. 
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that we have sampled from the family that has 
a conventional N-terminus, the other ten species 
having the highly distinctive Phe-Asn, which 
alone accounts for three nucleotide differences. 

Four Myrtalean families and Thymelae- 
aceae. In this study four families which, as dis- 
cussed above, are agreed to be members of the 
order Myrtales, were chosen and, from each, two 
species, each from a different genus, were se- 
lected. Dahlgren and Thorne (1984) considered 
that, though closely related, Thymelaeaceae was 
not a Myrtalean family, so it was chosen as an 
“out-group” (to define the base of the tree) and 
two species were studied. When these ten species 
were analyzed simultaneously, there were two 
consensus trees out of 17 minimal trees (Fig. 2). 
In one tree the species pair perfectly into the five 
families, whereas in the other two species of Ly- 
thraceae lie side by side. Sequences at the five 
familial nodes shown in Figure 2 were derived 
and used in an analysis with Ludwigia, Lopezia, 


and Circaea and other external angiosperm nodes 
(see below) to give the familial node for Ona- 
graceae shown in Figure 1. The grouping of the 
six familial nodes is shown, with other infor- 
mation, in Figure 3; five of the families arise from 
a single junction but Onagraceae diverge sepa- 
rately though close to the five. The surprising 
result in this tree is that Thymelaeaceae appear 
to be a very definite member of the same group 
as most of the Myrtalean families and not, as 
had been expected, an “‘out-group.” 
Euphorbiales. In one of the currently respect- 
ed angiosperm taxonomies, Cronquist (1981) did 
indeed place Thymelaeaceae in Myrtales, al- 
though he did so because the family fits in there 
more easily than anywhere else, not because he 
considered it a clear-cut Myrtalean family (Cron- 
quist, 1984). Dahlgren and Thorne (1984) stated 
that a number of taxonomists believe that Thy- 
melaeaceae approach most closely Euphorbiales 
and they themselves considered that Thymelae- 
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FIıGURE 2. Unrooted phylogenetic tree for two 


species from each of five families. The members of a 
family pair perfectly in one consensus tree but in the 
equally probable second tree the two members of Ly- 
thraceae are side by side. For species names see Table 
1. Note that in these trees angles are meaningless. Five 
of the familial nodes used in deriving Figure 3 were 
obtained from this tree. 


aceae should be placed near Euphorbiaceae, and 
that these two families are related to Malvales. 
It therefore seemed appropriate to investigate 
Euphorbiales and Malvales. 

To represent Euphorbiales, two species were 
chosen from Euphorbiaceae and two from Bux- 
aceae, although it was understood that one of the 
latter, Simmondsia, is often placed in a family 
of its own. The four species were first analyzed 
in conjunction with the Myrtalean familial nodes 
mentioned above; all four diverged from a com- 
mon node, confirming the reality of the order 
Euphorbiales with the two Euphorbiaceae ap- 
propriately grouped. However, the two putative 
Buxaceae did not have a common familial node 
suggesting that the separation of Buxaceae and 
Simmondsiaceae might indeed be valid. 

In the Malvales, three species of Malvaceae 
had already been studied (Martin & Dowd, 1984) 
so, in order to broaden the representation, two 
species of Tiliaceae were sequenced. These were 
analyzed with the Malvaceae species and from 
the minimal tree (Fig. 4) both familial nodes were 
derived. 

The analysis of the familial nodes for the five 
families of Myrtales, Thymelaeaceae, Euphor- 
biales, and Malvales was carried out with nodes 
for two other “external” families, Fabaceae and 
Brassicaceae (Martin et al., 1983) (Fig. 3). Thy- 
melaeaceae do not move from their position in 
the Myrtales and Onagraceae still diverge slightly 
separate from most Myrtales. The major surprise 
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FiGure 3. Unrooted minimal phylogenetic tree for 
ten familial nodes and four species from Euphorbiales. 
The familial nodes were derived from the analyses rep- 
resented in Figures 1, 2, and 4. 


is that Euphorbiales diverge from the Myrtalean 
tree. We have carried out similar analyses sub- 
stituting other families for the two “external” 
ones without changing this result. Thus, except 
for the junction with the Euphorbiales and Thy- 
melaeaceae, four of the Myrtalean families (Ly- 
thraceae, Combretaceae, Melastomataceae, and 
Myrtaceae) form a natural group as would be 
expected for an order like Myrtales. 


DISCUSSION 


As an exercise for testing the ability of objec- 
tively-generated phylogenies derived from se- 
quence data to reflect the conclusions of taxon- 
omy, this must be regarded as at least a partial 
success. Most notable is the correct grouping of 
species into families as displayed in Figures 2, 3, 
and 4. In the more extensive study of Onagra- 
ceae, two successes can be recorded; the correct 
grouping of two species each into the genera 
Clarkia and Epilobium; and the identification of 
the first divergence, which confirms several other 
lines of strong evidence that Ludwigia is the “‘sis- 
ter group” ofall other Onagraceae (Raven, 1979). 
The separation of the taxa into the other six tribes 
(Raven, 1979) is more uncertain and three of 
them (Onagreae, Hauyeae, and Epilobieae) de- 
part from the tree so closely that they are effec- 
tively indistinguishable; according to Raven 
(1979) there is a possible relationship between 
Hauyeae and Onagreae. The other three tribes 
(Lopezieae, Circaeeae, and Fuchsieae), which are 
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FIGURE 4. Unrooted minimal phylogenetic tree for 
species of Malvaceae and Tiliaceae. The familial nodes 
for these two families, used in Figure 3, were derived 
from this tree. 


all monogeneric, are not confused but Fuchsia is 
thought to be the most generalized and might 
have been expected to diverge earliest. These are 
rather stringent tests of our methods because, 
when relationships are close and differences are 
small, chance is more likely to operate to give a 
misleading result; for this reason, Martin et al. 
(1983) made a strong point of deriving familial 
nodes from which to generate inter-familial phy- 
logenies, the primary purpose of this research. 

When considering the comparison of our phy- 
logenetic trees with those derived for the Myr- 
tales from taxonomy, the best source of com- 
parison appears to be four trees derived from 
Johnson and Briggs (1984). These authors do not 
appear to have a clear preference for one or more 
of these so all are shown in Figure 5; only the 
families that overlap with this study are repre- 
sented, viz. Onagraceae, Lythraceae, Combre- 
taceae, Melastomataceae, and Myrtaceae. There 
appears to be enough variation present among 
these four trees to accommodate most of the dif- 
ferences in our tree. The main discrepancy is that 
in some of their trees Myrtaceae diverges near 
the base; the main similarity is that Onagraceae 
and Lythraceae usually diverge early. However, 
it must be said, without in any way implying 
criticism, that the main objective of the whole 
exercise has been partly frustrated by the lack of 
a clearly defined phylogeny to act as a model 
against which we can test our methods. 

The inclusion of Thymelaeaceae as an “‘out- 
group” for the Myrtales, as suggested by the pa- 
per of Dahlgren and Thorne (1984), did not serve 
that purpose because in our study the family 
grouped with Myrtales. This was indeed the 
opinion of Cronquist (1984), so we were able to 
perform a test, in the best scientific tradition, to 
distinguish between the hypothesis of Cronquist 
and the alternative one of Dahlgren and Thorne 
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FIGURE 5. A comparison of phylogenies from 
Johnson and Briggs (1984) (a, b, d, and e) with equiv- 
alent ones from this paper (c and f). a and b are from 
Johnson and Briggs’ phylograms A and C, respectively, 
and d and e are congruence cladograms (not to scale) 
from their figures 6 and 7. MRT, Myrtaceae; MLS, 
Melastomataceae; CMB, Combretaceae; LYT, Lythra- 
ceae; ONA, Onagraceae. 


(1984), that of a relationship of Thymelaeaceae 
to Euphorbiales and Malvales. The results of our 
test supported Cronquist’s hypothesis in that 
Thymelaeaceae did group with Myrtales, but this 
result should be considered in the light of the 
fact that there are sequence data for less than 
15% of dicotyledonous families; given tests 
against a much wider range of variation, the fam- 
ily may well group elsewhere. We believe that 
the fact that Thymelaeaceae and Euphorbiales 
do not group with Malvales is stronger evidence 
against the hypothesis of Dahlgren and Thorne 
(1984). 

In another respect the test has been partly suc- 
cessful because the families of Myrtales do form 
a natural group separate from others, always ex- 
cepting Euphorbiales and, possibly, Thymelae- 
aceae. It is true that Onagraceae seem to diverge 
separately from the rest but additional data could 
conceivably change this. It must be acknowl- 
edged that Hutchinson (1973) placed Onagraceae 
in an order of their own but this seems to be 
quite contrary to the great majority of evidence 
produced in the symposium at the XIII Inter- 
national Botanical Congress (Raven, 1984). 
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